Simultaneous Clustering and Feature Ranking by Competitive Repetition Suppression Learning with Application to Gene Data Analysis

نویسندگان

Davide Bacciu

Alessio Micheli

Antonina Starita

چکیده

The paper presents feature-wise Competitive Repetition-suppression (CoRe) clustering, a novel unsupervised algorithm that deals with the automatic determination of the unknown cluster number and simultaneous feature ranking. The proposed model addresses the limitations of the original CoRe learning algorithm when dealing with high dimensional data, extending the repetition suppression competition on a feature-wise basis. The effectiveness of the approach is tested on gene expression data from DNA microarrays: the results show that the feature-wise CoRe clustering algorithm is able to detect the known data partitioning in a completely unsupervised fashion. Moreover, it simultaneously develops a gene ranking that is consistent with the state-of-the-art list of gene relevance for the selected benchmark datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature-wise Competitive Repetition Suppression Learning for Gene Data Clustering and Feature Ranking

The paper extends Competitive Repetition-suppression (CoRe) learning to deal with high dimensional data clustering. We show how CoRe can be applied to the automatic detection of the unknown cluster number and the simultaneous ranking of the features according to learned relevance factors. The effectiveness of the approach is tested on two freely available data sets from gene expression data and...

متن کامل

A Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data

The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...

متن کامل

Supervised Clustering of Label Ranking Data

In this paper we study supervised clustering in the context of label ranking data. Segmentation of such complex data has many potential real-world applications. For example, in target marketing, the goal is to cluster customers in the feature space by taking into consideration the assigned, potentially incomplete product preferences, such that the preferences of instances within a cluster are m...

متن کامل

MOSCFRA: A Multi-objective Genetic Approach for Simultaneous Clustering and Gene Ranking

Microarray experiments generate a large amount of data which is used to discover the genetic background of diseases and to know the gene characteristics. Clustering the tissue samples is an important tool for partitioning the dataset according to co-expression patterns. This clustering task is even more difficult when we try to find the rank of each gene (Gene Ranking) according to their abilit...

متن کامل

Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis

Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

Simultaneous Clustering and Feature Ranking by Competitive Repetition Suppression Learning with Application to Gene Data Analysis

نویسندگان

چکیده

منابع مشابه

Feature-wise Competitive Repetition Suppression Learning for Gene Data Clustering and Feature Ranking

A Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data

Supervised Clustering of Label Ranking Data

MOSCFRA: A Multi-objective Genetic Approach for Simultaneous Clustering and Gene Ranking

Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis

عنوان ژورنال:

اشتراک گذاری